ProPolyne: A Fast Wavelet-Based Algorithm for Progressive Evaluation of Polynomial Range-Sum Queries
نویسندگان
چکیده
Many range aggregate queries can be eÆciently derived from a class of fundamental queries: the polynomial range-sums. After demonstrating how any range-sum can be evaluated exactly in the wavelet domain, we introduce a novel pre-aggregation method called ProPolyne to evaluate arbitrary polynomial range-sums progressively. At each step of the computation, ProPolyne makes the best possible wavelet approximation of the submitted query. The result is a data-independent approximate query answering technique which uses data structures that can be maintained eÆciently and exactly. ProPolyne's performance as an exact algorithm is comparable to the best known MOLAP techniques. Experimental results show that this approach of approximating queries rather than compressing data produces consistent and superior approximate results when compared to typical wavelet-based data compression techniques.
منابع مشابه
WOLAP: Wavelet-Based Range Aggregate Query Processing
The Discrete Wavelet Transform has emerged as an elegant tool for data analysis queries. It was not until the time we proposed a new wavelet technique, ProPolyne, for fast exact, approximate, or progressive polynomial aggregate query processing that data did not have to be compressed, unlike most of the prior studies in this area. In this paper, after reviewing our ProPolyne technique in more d...
متن کاملFast Approximate Evaluation of OLAP Queries for Integrated Statistical Data
We have developed a mediator architecture that integrates statistical information about energy products from several government agencies, such as the Bureau of Labor Statistics, the Energy Information Administration, and the California Energy Commission. Our architecture has a dual mode of operation. First, our system can retrieve live data from databases and web sources from these agencies. Th...
متن کاملWavelet - Based Relative Prefix Sum Methods for Range Sum Queries in Data
Data mining and related applications often rely on extensive range sum queries and thus, it is important for these queries to scale well. Range sum queries in data cubes can be achieved in time O(1) using prefix sum aggregates but prefix sum update costs are proportional to the size of the data cube O ( nd ) . Using the Relative Prefix Sum (RPS) method, the update costs can be reduced to the ro...
متن کاملWavelet-based relative prefix sum methods for range sum queries in data cubes
Data mining and related applications often rely on extensive range sum queries and thus, it is important for these queries to scale well. Range sum queries in data cubes can be achieved in time O(1) using prefix sum aggregates but prefix sum update costs are proportional to the size of the data cube O ( nd ) . Using the Relative Prefix Sum (RPS) method, the update costs can be reduced to the ro...
متن کاملWavelet Disk Placement for E cient Querying of Large Multidimensional Data Sets
New data intensive applications operate on diverse types of data with new charac teristics in querying the data In particular the data set is large and multidimen sional popular examples are spatial and temporal data as well as sensor data streams the queries are complex asking for trends or outliers in data correlation between dif ferent dimensions or aggregation of one or more measure attribu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002